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Abstract. - The problem of sending the maximum amount of flow q between two arbitrary 
nodes s and t of complex networks along links with unit capacity is studied, which is equivalent 
to determining the number of link-disjoint paths between s and t. The average of q over all 
node pairs with smaller degree fe m i n is {q)k min — cfcmm for large fc m in with c a constant implying 
that the statistics of q is related to the degree distribution of the network. The disjoint paths 
between hub nodes are found to be distributed among the links belonging to the same edge- 
biconnected component, and q can be estimated by the number of pairs of edge-biconnected 
links incident to the start and terminal node. The relative size of the giant edge-biconnected 
component of a network approximates to the coefficient c. The applicability of our results to 
real world networks is tested for the Internet at the autonomous system level. 



The analysis of network structures of complex systems has turned out to be extremely 
useful in exploring their large-scale organization and unveiling their evolutionary origin [1-4] . 
Anomalous features found by their graph-theoretical analyses [5-7] are the fingerprints of their 
hidden organization principles as well as the key to predicting their behaviors. Real- world 
networks usually serve a particular purpose which frequently can be expressed in terms of 
flow problems, in particular in the context of transport between the nodes along links (edges) 
with restricted capacities. For instance if one is interested in the maximum possible flow that 
can be sent from one node to another, one has to solve a maximum flow problem, for which 
polynomial algorithms exist. It also arises per se in a variety of situations such as assignment 
and scheduling problems [8] . In real- world networks one is often interested in the question of 
how many link-disjoint paths do exist between a particular node pair - for instance to establish 
as many independent transportation routes as possible between two nodes in the occasion of 
a sudden demand (like in the event of a natural catastrophe or in a military context). This 
problem is again identical to the maximum flow problem between those two nodes in the same 
network with unit-capacity links. 

Properties of complex networks related to transport are inherently connected to their 
topological structure, in particular to various aspects of connectedness. The existence of a 
single path from one node to another is guaranteed if both nodes belong to the same connected 
component of the network. Moreover, nodes that belong to a single biconnected component 
have two disjoint paths between them. We will show in this paper that the arrangement of 
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the biconnected components of a given network is the essential determinant for the number 
of disjoint paths between two nodes of large degree, the hubs (which is non-trivial as soon as 
more than two disjoint paths between nodes exist). In this way we also establish a specific 
connection between the topological structure and the flow properties of complex networks. 

To be specific we consider undirected networks of heterogeneous connectivity pattern and 
calculate the number of link-disjoint paths between two nodes s and t. We assign a unit flow 
capacity to each link and compute numerically the maximum flow, denoted by q s j, between s 
and t [9]. Since only one unit of flow can be sent along each link, q Syt is a non-negative integer 
and is equal to the number of link-disjoint paths connecting s and t. 

We will show that on average the maximum flow between two nodes is proportional to the 
minimum of their degree (= number of incident links), with a proportionality constant c that is 
asymptotically independent of this degree. As a consequence we obtain that in scale-free (SF) 
networks, a class of networks where the degree k follows a power-law distribution Pd{k) ~ A; -7 
asymptotically, the maximum flow obeys a power-law distribution with the exponent 27 — 1 
while in completely random networks it follows a Poisson distribution. Moreover we will 
demonstrate that the coefficient c reflects the effect of the global connectivity pattern on 
transport properties such that the constant c varies depending on the total number of links 
as well as the degree exponent. Finally we will show that the edge-biconnectedness is crucial 
in the transport among hub nodes and that the maximum flow between hub nodes can be 
estimated by the number of pairs of edge-biconnected links incident to the start and terminal 
node. This implies also that the coefficient c is related to the relative size of the giant edge- 
biconnected component. 

The degree of a node i is the number of incident links and is denoted as hi. For illustrational 
purposes we first consider D > 1-dimensional regular lattices where all nodes have the same 
degree 2D except for those at the boundary. Due to the homogeneous connectivity pattern, 
the maximum flow or the number of link-disjoint paths between two distinct nodes s and t is 
2D unless either node is at the boundary. If s or t is at the boundary, the maximum flow is 
given by the smaller value of k s and k t . Therefore q Si t in regular lattices can be expressed as 

q S ,t = fcmin, (1) 

where k mm = min{£; s , kt}- 

In contrast, real-world networks typically have a heterogeneous connectivity pattern: Hub 
nodes with a very large degree as well as isolated nodes are simultaneously present. The 
following question then naturally arises: Does eq. (JIJ hold for such heterogeneous networks? 
To answer this question, we computed the maximum flow for each pair of nodes in random SF 
networks [10] using the MAX_FLOW algorithm [11]. Hundreds of network realizations were 
generated for given numbers of nodes N and links L, and the degree exponent 7, for which 
we were able to identify that the average of q over the node pairs that have /c m ; n as smaller 
degree, (g)fc mln , satisfies a relation that is analogous to eq. QJ: 

(?)fcmto — ck min , (2) 

for large fc m i n , with a coefficient c less than 1 [See fig-UJa)]. 

If we define the distribution of the maximum flow as P(q) = (J2 s ^t ^« s ,t,?/Es^t -*-)) with 
(■ • • ) denoting the ensemble average, it can be decomposed in terms of the smaller degree 
as P(q) = J2k P ™in{k)P(q\k), where P min (k) = {J2 S jt t S rxdn{k a ,k t },k/(E, s? tt !)) and p (l\ k ) = 
(E s ^( Sq,, t ,q £mm{fc a ,fc t },fc)/(E s #t <W{fc s ,fe t },fc> • The conditional distribution P(q\k) is sharply 
peaked around the average q = (q)k as shown in fig.^b), which implies P(q) ~ P m in{k — lie) 
with a good accuracy, where c is the coefficient appearing in eq. (J2J. -P m in(fc) is related to the 
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Fig. 1 - Maximum flow in model networks, (a) (g)fc min /fc m m versus fc m i n in SF networks with 7 = 
2.5 and different values of (k) = 2L/N. (b) Conditional distributions P(q\k) in SF networks with 
N — 1000, (k) = 1.6, and 7 = 2.5. (c) Distribution of the maximum flow P(q) for the ER graphs 
and SF networks with 7 = 2.5, both consisting of 1000 nodes and 2000 links ((fe) = 4). Each line 
represents Pmi n {q/c) with c ~ 0.97 used for the ER graphs and c ~ 0.94 for the SF networks, which 
is in good agreement with P(q) . (d) Flow efficiency c versus (fc) for the ER graphs and SF networks 
with 7 = 2.5. The statistical error is smaller than the data point size and N = 1000 in both networks. 



degree distribution Pd(k) by P m i n (k) = 2Pd{k)^ k , >k Pd{k')- Thus the asymptotic behavior 
of the degree distribution determines the large-g behavior of P(q) [fig-dc)]: 

P(q) ~ e ~( const -) 9 ln («/< fc )) j 
for the Erdos-Renyi (ER) graph that have P d (k) ~ e -kln(k/(k)) j 12 ] ; and 

P(q)~q- {2l - 1} , (4) 

for SF networks with the degree exponent 7. Recently Lopez et al. reported the same 
asymptotic behavior for the conductance distribution in complex networks [13]. 

Contrary to D > 1-dimensional regular lattices, even when k min is large, the maximum 
flow in heterogeneous networks may be very small due to the collision of connecting paths at 
some critical links. For example, in figEK a ) j ^ ne maximum flow between the node 2 and 14 is 
only 1 because every connecting path has to pass the link (3, 13). The value of c in eq. is 
thus a measure for how efficiently such collisions are avoided in the underlying path structure. 
From now on, we call the coefficient c flow efficiency, which depends on the network topology 
as shown in fig.^d). While c is higher with more links, its dependence on the degree exponent 
7 is non-trivial. The example in fig. ^d) shows that c is higher in SF networks for (k) < 2.5 
while it is higher in ER graphs otherwise. This illuminates ambivalent effects of heterogeneity 
on transport. When links are abundant, most node pairs can be connected and have as many 
connecting paths as k m - m . What matters is then to avoid collisions of those connecting paths, 
which can be achieved more efficiently in networks closer to a regular one. On the contrary, 
when links are deficient but as many pathways as possible are required between each pair of 
connected nodes, concentration of links within hub nodes is preferable to forming a chain of 
nodes all of which have degree 2. 
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Fig. 2 - (color online) Bridge, separation pair, and EBCC. (a) The links (1, 2), (8, 12) and (3, 13) are 
bridges that would increase the number of CCs if removed [15]. The network in (a) has two EBCCs 
(I) and (II) shown in (b). The pair of links {(4, 7), (5, 10)} is a separation pair that disconnects the 
EBCC (II). An example of the EBCC in a SF network is shown in (c), which has 100 nodes, 80 links, 
and 7 = 2.5. It has 5 CCs that has more than one node and the largest CC has an EBCC presented 
in (d). 

In the following we will show that the flow efficiency is related to particular topological 
properties of the network that can be described by various aspects of connectedness. One unit 
of flow can be sent from a node s to another node t if and only if a link e sv incident to s and 
a link e tw incident to t belong to the same connected component (CC) or equivalently, both 
links are connected. A CC of a graph is a maximal subgraph in which at least one connecting 
path exists between each pair of nodes. Next, the second unit of flow requires another path 
that does not share any link with the first path. This condition is fulfilled if and only if at 
least two links e sv and e sv i that are incident to s and at least two links et w and et W ' incident 
to t belong to the same edge-biconnected component (EBCC) or equivalently, there exist two 
pairs of edge-biconnected links, {e sv ,et w ) and {e sv i, et W ')- An EBCC is a maximal subgraph 
which cannot be disconnected by removing a single link. Each node of a graph either belongs 
to a unique EBCC or does not belong to any EBCC. 

One could now naively expect that k > 2 units of flow between s and t require that s and 
t belong to the same /c-edge-connected component (a fc-edge-connected graph is a maximal 
subgraph which cannot be disconnected by removing any [k — 1) links [14] and thus, a graph 
is (k — l)-edge-connected if it is fc-edge-connected). This is a sufficient condition, but not 
a necessary one for k > 2 units of flow: The nodes on k disjoint paths connecting s and 
t may have degree 2 and thus the nodes s and t may belong to different fc-edge-connected 
components even though they have k > 2 link-disjoint paths. 

It turns out that already the structure of the connected and the biconnected components 
determines the existence of flow values q s _t > 1 between two nodes s and t. Obviously for 
Qs.t > 1) s and t must belong to the same CC. For q Stt > 1, s and t should have two pairs of 
edge-biconnected links. When s and t belong to the same CC but to different EBCCs, the 
maximum flow will only be one, q s j — 1. There exist one or more bridges [15] between different 
EBCCs belonging to the same CC, which would disconnect the EBCCs if removed [fig- EI - 
Analogously if there exists a separation pair of links that disconnect s and t if removed [fig. [21, 
q s .t will not be larger than 2 even when s and t belong to the same EBCC and k m i n > 2. 

To study this relation between flow and topology statistically, we determined all CCs and 
EBCCs in a given network and defined for all node pairs (s, t) 

N c AT obc 

6 M = £nrin{/J»>, //">}, E,, t = £ min{ 5 M,^}. (5) 

71—1 771=1 
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Fig. 3 - (color online) Relation between the maximum flow and edge-biconnectedness. (a)-(c) Condi- 
tional probabilities Pt. and Pe versus kmin/ Kmin in SF networks with 7 = 2.5 and (a) (k) = 0.5, (b) 
(fc) = 1.6, and (c) (k) = 4. Here K mln is the ensemble-averaged largest value of fc m i n . Note that Pe 
approaches one for increasing average connectivity (k). (d) Plots of c, cs, m c , and m c b c versus (k) 
for SF networks with 7 = 2.5 and AT = 1000. 



Here N c (-/V e b c ) is the total number of CCs (EBCCs) in a network, f\ n ' (g\ J ) is the number 
of links incident to node i and belonging to the n-th CC (m-th EBCC). <d s j C^s,t) counts 
the number of disjoint pairs of connected (edge-biconnected) links incident to s and t. While 
/i"' 1 (/ t ) is either or k s (kt), g^ (<4 ) can take any integer between and k s (k t ) since 
each link incident to a node may belong to an EBCC or not. The deviation of q s ^ from s>t 
or E Si t is therefore a measure of the number of critical links, bridges or separation pairs. 

We computed the conditional probability Pq that q St t is equal to Sj t and Pe that q s j 
is equal to E Sit for given values of k mul in various model networks. It turns out that E Sjt is 
in a good agreement with q Sft for large values of /c m i n , manifested via values for Pe close to 
one. In SF networks with 7 = 2.5, as shown in fig.|3{a)-(c), Pe is larger than 0.5 ((fc) = 0.5), 
0.8 ((k) — 1.6), and 0.9 ((k) = 4) for a wide range of k m i n /K m i n < 1 independent of the 
value of N. K m i n denotes the ensemble-averaged largest value of fc m ; n that can be observed in 
networks of N nodes, and is given by [10] K min = (k)(-y - 2)(iV/2) 1 /( 7 " 1 V(7 - 1). Whereas 
Pe approaches one for a range of fc m ; n that broadens with increasing average connectivity (k), 
the probability Pq is very small except for the regime where A: m i n is small. The agreement of 
q S: t and E S) i for large /c m i n is observed also for other values of 7 > 2 and ER graphs (7 — * 00). 

These results elucidate non-trivial features of the paths connecting hub nodes. Most 
dangerous links that may prevent large flow between hub nodes are bridges. If two hub nodes 
belong to the same EBCC, they can send and receive a flow nearly as large as the number of 
links belonging to that EBCC. In other words, separation pairs are very rare in the EBCC to 
which hub nodes belong. The agreement between the maximum flow and the number of pairs 
of edge-biconnected links can be of importance in practical aspects as well: The algorithm to 
compute EBCCs has running time 0(N + L) [15] while the maximum flow takes 0(N 3 ) time 
in sequential machines [16]. 

The agreement of q St t and E s t allows a deeper understanding of the relation ©. Con- 
sidering that E s t is dominated by the giant EBCC, defined here as the EBCC that has a 
O(L) links, one can evaluate (E)fc min , the average of E over the node pairs having A: m i n as 
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Fig. 4 - Maximum flow for the autonomous system network of the Internet with N = 5238, L = 9993, 
and 7 = 2.2(1). (a) (g)fc min /fc m in as a function of fe m i n . It fluctuates around c = 0.79(6), which is 
consistent with the relative size of the giant EBCC m c b c — 0.80. (b) Plots of conditional probabilities 
Pe, P&, Ps, and P s versus /c min / -K" m i n with K mia ~ 449. 



their smaller degree. Consider the relative size of the giant EBCC m e bc denned as the ratio 
(Number of links in the giant EBCC)/ (Total number of links). Assuming that the links of a 
node participate in the giant EBCC statistically independently of one another, the number of 
links that belong to the giant EBCC, of s and t, denoted by gi 1 ^ and g^, respectively, follow 
Binomial distributions B{k Sl m c \ ic ) and _B(fc t ,m b c ) respectively, giving 



<£>w_ -EE 

9l=0g2=0 



in \ ( k 



mm \ / "-max 

5i / V 92 



m ebc ffl+92 



x (1 - m cbc ) fc -» +fe — -^-^mxa{ 9l ,g 2 }, (6) 

where fc max = max{fc s , k t } and (• • • ) x , y means the restricted average over the pairs of nodes 
that have min{fc s , kt} = x and max{fc s , k t } — y. Dominant contributions to the summations in 
eq. © come from the regime (51,32) (m chc k mia , m chc k max ) where min{gi,g 2 } = m chc k min . 
Thus we have 

( S )femi„ — m cbc fcmin- (7) 

Equation Q and the agreement of q St t and T, s _ t provide the basis from which eq. J2J follows. 
In accordance with eq. 0, we indeed observed that (S)fc min ~ cs fc m j n for large fc m i n , and the 
values of cs and c are compared with m c b c in fig- Eld) . Although m e bc is not exactly equal to 
cor cs, both of which are in excellent agreement, because of the correlations among different 
links participating in the giant EBCC, the deviation is so small that m c b c can be a guide to 
the value of c. We can also consider m c defined as the ratio (Number of links in the giant 
CC)/ (Total number of links), but it turns out to deviate considerably from the value of c or 
cs as shown in fig. Eld). 

Finally we have tested our findings in a real network, the Internet at the autonomous 
system (AS) level. An AS includes a set of routers following a common routing strategy and 
the Internet may be viewed as a network of the ASs [17]. Here we used the data recorded 
on a particular date (June 15, 1999) and present the results in fig. 0] The network consists 
of 5238 nodes and 9993 links ((k) ~ 3.82) and the degree exponent is 7 = 2.2(1). All edge 
capacities are set to one. While all the nodes are connected with one another (m c = 1), only 
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3251 nodes and 8006 links belong to the giant EBCC, giving m c b c — 0.80. We found that the 
relation in eq. ||2J) is observed and furthermore, the flow efficiency c = 0.79(6) is very close to 
the value of m c bc- The conditional probability P^ is shown to be very high in contrast to Pq. 
Additionally we computed the relaxed probabilities, Ps and Pq, defined as the probability 
that |1 — q Sy t/^ s ,t\ < 0.05 and the probability that |1 — q s ,t/Qs,t\ < 0.05, respectively. Ps is 
near one for most values of k m { n , but Pq differs only slightly from Pq. 

In conclusion, we studied the relation between transport, specifically the maximum flow, 
and the structural organization in complex networks. The path structure of complex net- 
works with heterogeneous connectivity is efficiently organized such that nodes with more links 
can send and receive larger amount of flow as expressed in eq. @. This may explain the 
abundance of heterogeneous connectivity pattern in complex systems that might have evolved 
towards better performance of transportation and communication among their constituents. 
The structure of edge-biconnected components determines essentially the maximum flow be- 
tween hub nodes, which therefore can be estimated in a time that is linear in the total number 
of nodes. 

* * * 

We thank H. Jeong for providing the Internet data. 
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